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ABSTRACT 


Gary  Knott  has  presented  algorithms  for  computing  a bijection 
between  the  set  of  binary  trees  on  n nodes  and  an  initial  segment 
of  the  positive  integers.  Rotem  and  Varol  presented  a more  com- 
plicated algorithm  that  computes  a different  bijection,  claiming 
that  their  algorithm  is  more  efficient  and  has  advantages  if  a 
sequence  of  several  consecutive  trees  is  required.  We  present  a 
modification  of  Knott's  algorithm  that  is  simpler  than  Knott's 
and  as  efficient  as  Rotem  and  Varol' s.  We  also  give  a new 
linear-time  algorithm  for  transforming  a tree  into  its  successor 
in  the  natural  ordering  of  binary  trees. 
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significance  and  Explanation 


! 


We  illustrate  the  mathematical  problem  discussed  in  this  paper  in  terms 
of  one  possible  application,  the  storage  and  retrieval  of  information  in 
digital  computers  by  techniques  that  depend  on  a series  of  yes/no  questions. 
The  binary  tree  is  a structure  that  makes  it  easy  to  put  information  into, 
and  take  information  out  of,  a data  base,  using  such  a sequence  of  questions. 

Given  a certain  amount  of  information,  there  will  be  many  different 
ways  of  representing  this  in  the  form  of  binary  trees.  The  procedure  de- 
scribed in  this  report  gives  all  possible  ways  of  storing  this  information 
in  binary  trees.  Given  one  representation,  it  is  possible  to  find  the  next 
one  in  line.  Any  representation  in  the  list  can  be  generated  without  going 
through  all  of  them. 
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The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive  summary 
lies  with  MRC,  and  not  with  the  authors  of  this  report. 


A NOTE  ON  ENUMERATING  BINARY  TREES 
Marvin  Solomon  and  Raphael  A.  Finkel 

Introduction 

Gary  Knott  has  published  algorithms  for  computing  a bijection 
Rank  from  the  set  of  binary  trees  with  n nodes  to  an  initial  seg- 
ment of  the  integers  and  for  computing  Rank  ^ [1].  His  method 
for  computing  Rank~^  involves  generating  certain  permutations  of 
{1,  2,  n}  called  tree  permutations^  which  are  in  one  to  one 

correspondence  with  the  binary  trees  and  from  which  the  binary 
trees  may  be  easily  constructed.  Rotem  and  Varol  propose  an  al- 
ternative technique  for  generating  the  trees  [2].  Instead  of 
constructing  the  tree  permutations  (which  they  call 
stack-sortable  permutations)  directly,  they  construct  the 
inversion  tables  of  these  permutations,  which  are  sequences  of 
non-negative  integers  called  ballot  sequences.  They  show  that 
the  ballot  sequences  may  be  generated  in  lexicographic  order  and 
present  a clever  and  efficient  algorithm  for  converting  the  bal- 
lot sequences  into  trees.  In  this  note,  we  present  modifications 
of  Knott's  algorithms  that  map  d irectly  between  trees  and  in- 
tegers. In  addition,  we  correct  some  misconceptions  put  forth  by 
Rotem  and  Varol,  and  present  a new  algorithm  that  transforms  a 
given  tree  to  the  next  one  in  sequence. 

In  comparing  their  technique  to  Knott's,  Rotem  and  Varol  con- 
cede that  their  method  does  not  generate  the  trees  in  the  "natur- 
al" order,  but  claim  two  advantages;  (1)  They  say  their  tech- 
nique is  more  efficient,  since  it  is  "well-known"  that  the  map- 
ping  of  permutations  to  trees  requires  0(n  ) operations  in  the 
worst  case  for  trees  with  n nodes.  In  contrast,  their  method  re- 
quires only  0(n  log  n)  operations  to  create  an  n-node  tree,  pro- 

Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-75-C-0024.  Partial 
support  by  the  Research  Committee  of  the  Graduate  School  of  the  University  of 
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vided  a table  of  n values  has  been  precomputed.  They  also  imply 
a similar  advantage  for  their  calculation  of  Rank.  (2)  They 
point  out  that  a ballot  sequence  may  be  efficiently  generated 
from  the  previous  one  and  converted  to  a tree,  yielding  a tech- 
nique for  generating  a sequence  of  trees.  They  claim  that  using 
Knott's  method,  "to  generate  k trees  corresponding  to  consecutive 
permutations  would  require  the  transformation  of  k indices,  since 
there  is  no  simple  way  of  deriving  stack-sortable  permutations  in 
their  order  corresponding  to  the  natural  order  of  trees"  [1,  p. 
404]. 

With  regard  to  the  first  claim,  we  point  out  that  whereas  the 

mapping  of  an  arbitrary  permutation  to  the  corresponding  tree  may 
2 

take  0(n  ) operations,  a tree  permutation  may  be  converted  to  a 
tree  in  0(n)  operations,  due  to  its  special  properties.  Rather 
than  prove  this  result  here,  however,  we  present  a modification 
of  Knott's  algorithm  that  translates  directly  from  indices  to 
trees. 

With  regard  to  the  generation  of  sequences  of  trees,  we  show 
how  to  transform  a tree  to  the  next  one  in  the  natural  order  by 
an  algorithm  that  %«orks  directly  on  the  tree  and  requires  0(n) 
operations. 


Definitions 

A binary  tree  T is  either  a nul 1 tree  or  consists  of  a node 
called  the  root  and  two  binary  trees  denoted  Lef t (T)  and 
Right (T) . In  the  former  case,  the  size  of  T is  zero;  in  the 
latter  case  Size(T)  = 1 + Si ze (Lef t (T) ) + Si ze (Right (T) ) . We 
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will  often  identify  a tree  with  its  root. 

Define  a relation  on  trees  by  if  and  only  if  one  of 

the  following  conditions  holds: 

Size{Tj)  < Size(T2) 

or  Size(T^)  = Size(T2)  and  Left  (T^) -<  Left  (T2) 

or  Size(T.)  = Size(T-)  and 

Left(T£)  » LeftCTp  and  Right  Right  (T2)  . 

This  ordering  is  called  the  natural  ordering  of  trees.  Let 

T-,,  ...r  T_  } be  the  sequence  of  all  trees  of  size  n,  ordered  by 

“n 

the  natural  ordering.  It  is  well  known  that  is  the  nth 

Catalan  number  [3]:  » ^^n^n^l*  Define  Rank(T^)  » i.  Let 

First(n)  denote  the  tree  depicted  in  Figure  1(a)  . The  predi- 
cate Last(T)  is  true  if  and  only  if  T = T„  , depicted  in  Figure 
®n 

Kb)  . 


The  Algorithms 

In  the  Pascal  [4]  program  presented  in  the  appendix,  a tree  T 
is  represented  by  a pointer  to  a structure  containing  the  two 
trees  Left(T)  and  Right(T)  as  well  as  Size(T).  The  size  is  pro- 
vided for  efficient  implementation  of  Rank  and  Mext. 

The  main  program  computes  a table  of  Catalan  numbers  using 

6B  -, 

the  recurrence  cited  by  Knott:  B^  » 4B^  , - . It  also  com- 

n + 1 

putes  a table  of  other  values  which  are  used  to  speed  up  the  cal- 
culation of  Rank  and  Rankinverse  as  described  below. 

The  procedure  Next  attempts  to  transform  T to  the  successor 
of  T in  the  natural  ordering.  Resul t is  set  to  true  if  Next 
succeeds  or  to  false  if  Last (T)  is  true.  The  method  comes 


mm. 


wmm 


directly  from  the  definition  of  the  natural  order.  The  successor 
of  T may  be  formed  by  attempting  first  to  transform  T’s  right 
subtree  to  its  successor.  If  T's  right  subtree  has  no  successor, 
then  it  is  reset  to  the  first  tree  of  its  size  and  the  left  sub- 
tree is  transformed  to  its  successor.  If  both  the  subtrees  are 
the  last  trees  of  their  sizes,  then  one  node  is  moved  from  the 
right  to  the  left  and  both  subtrees  are  re- initial ized . 

Let  T be  a tree  and  let  n » Size(T).  A top-level  call  of 
Next  gives  rise  to  at  most  one  recursive  call  for  each  node  of  T, 
plus  at  most  one  call  for  each  of  the  n - I null  trees  at  the 
leaves.  The  amount  of  work  in  one  call  to  Next,  exclusive  of  em- 
bedded calls  to  Next  and  to  Fi rst  is  bounded  by  a constant.  Each 
call  to  Fi rst  allocates  a node  of  the  new  tree,  so  the  total  work 
involved  in  calls  to  Fi rst  is  bounded  by  n.  Thus  Next  (T)  re- 
quires at  most  0(n)  time,  where  n » Size(T). 

The  procedure  Rankinverse  constructs  the  tree  whose  rank  is  i 
by  essentially  the  same  counting  argument  as  the  one  used  by 
Knott.  Let  denote  the  number  of  trees  with  n nodes  whose 

left  subtree  has  k nodes. norder . As  Knott  points  out, 

9 

rhen  the  size  of  the  left  subtree  of  Tj^  is  the 

largest  Integer  r such  that  Gj^^  < i.  As  Rotem  and 

k*  r 

Varol  point  out,  this  value  may  be  calculated  quickly  by  precom- 
puting some  values  of  and  using  binary  search  to  find  the 

largest  less  than  i.  The  complexity  of  finding  the  root  is 
then  O(log  r)  < 0{log  n)  . The  rest  of  Rankinverse  calculates  the 
ranks  of  the  left  and  right  subtrees  as  /(i  - S,^)/k|+  1 and 

^ ^ “ ^rn)  k I , 


respectively,  where  k is  the  size  of  the 


right  subtree.  Therefore,  the  time  devoted  to  one  call  of 
Rankinverse  can  be  bounded  by  O(log  n) , and  since  each  call  of 
Rankinverse  generates  one  node  of  the  resulting  tree,  the  total 
time  is  0 (n  log  n)  . 

The  procedure  Rank  (T)  works  by  counting  the  number  of  trees 
preceeding  T in  the  natural  ordering.  It  divides  them  into  three 
classes:  those  T'  for  which  Si2e(Left(T' ) ) < Si ze (Lef t (T) ) , 

those  for  which  Size{Left{T' ) ) * Size (Right (T) ) but 

Left(T')  Left(T),  and  those  for  which  Left(T')  * Left(T)  but 

Right(T')  Right(T).  The  size  of  the  first  class  is  just 
where  r is  the  inorder  number  of  the  root,  and  the  other  sizes 
can  be  calculated  in  a constant  amount  of  time,  exclusive  of  re- 
cursive calls.  The  procedure  Rank  is  called  once  at  each  node  of 
the  tree.  Hence,  if  the  S^^^  numbers  are  precomputed,  the  running 
time  of  Rank  is  bounded  by  0(n) . 


Conclusions 

We  have  presented  straightforward  and  efficient  algorithms 
for  computing  the  rank  of  a tree  in  the-  natural  ordering  of. 
binary  trees  of  a given  size  and  for  constructing  the  tree  with  a 
given  rank.  We  have  also  presented  a new  linear  algorithm  that 
transforms  a tree  to  its  successor  in  the  natural  ordering. 
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(a)  (b) 

Figure  1 

The  first  and  last  trees  with  n nodes 


type  tree  = ^ node; 
node  = 
record 

left,  right  : 
size  : integer 
end  ; 


(*  number  of  nodes  in  the  tree  *) 


n,  k : integer; 

array  [0  ..  MAX]  of  integer;  (*  Catalan  numbers 
array  [0  ..  MAX,  0 ..  MAX]  of  integer; 

(*  Sums  of  products  of  Catalan  numbers  *) 


var 


procedure  Initialize; 

(*  Precompute  the  tables  B and  S *) 
beg  in 

B[0]  :=  1; 

for  n ;=  1 to  MAX  do 
begin 

B[n]  ;=  4 * B[n-1]  - (6  * B[n-l])  div  (n  + 1) 

S [0 , n]  : = 0 ; 

for  k ;=  0 to  n - I do 

S[k+l,n]  :=  S[k,n]  + B [k]  * B[n-k-l]; 

end ; 

end  (*  procedure  Initialize  *) ; 


function  First(n  ; integer)  ; tree; 

(*  Generate  the  first  tree  having  h nodes  *) 
var  Resul t : tree  ; 
beg  in 

if  n = 0 then  First  ;=  nil 
else  begin 

new(Result)  ; 
with  Resul t't  do 
beg  in 

size  : = n ; 
left  : = nil  ; 
right  :=  First(n  - 1); 
end  ; 

First  ; = Resul t ; 
end  ; 

end  (*  Fi rst  *) ; 


procedure  Next(T  : tree;  var  Result  : Boolean); 

(*  Change  T to  the  next  (in  the  canonical  ordering)  tree 
if  possible.  Report  success  or  failure  in  Result.  *) 
label  99; 
var 

Ok  : Boolean; 

Rsize  : integer; 
beg  in 

if  T = nil  then  Result  :=  false 

else  with  Tf  do 

begin 

Next(right,  Ok)  ; 

if  right  = nil  then  Rsize  :=  0 else  Rsize  :=  rights. size; 
if  not  Ok  then 
beg  in 

Next(left,  Ok)  ; 
if  not  Ok  then 
beg  in 

Rsize  :=  Rsize  - 1 ; 
if  Rsize  < 0 then 
beg  in 

Result  :=  false; 
goto  99;  (*  return  *) 
end  ; 

left  ;=  First(size  - Rsize  - 1); 
end  ; 

right  :=  First(Rsize)  ; 
end  ; 

Result  :=  true; 
end  ; 

99: 

end  (*  Next  *)  ; 


function  Rank(T  : tree)  ; integer; 

var  Lsize,  Lrank,  Rsize,  Rrank  : integer; 

beg  in 

if  T » nil  then  Rank  :=  1 

else  with  do 

begin 

if  left  » nil  then  Lsize  :=  0 else  Lsize  :=  left'T.size  ; 
Lrank  :«  Rank(left) ; 

if  right  » nil  then  Rsize  :=  0 else  Rsize  ;=  rights. size; 
Rrank  :■  Rank(right); 

Rank  ;■  B [Rsize]  * (Lrank  - 1) 

+ Rrank 

> S [Lsize , size]  ; 

end ; 


function  Rankinverse ( i , n ; integer)  : tree; 

(*  Return  the  tree  whose  rank  is  i among  those  with  n nodes  *) 
var 

Low,  High,  Center  : integer;  (*  For  binary  search  *) 

Lsize,  Rsize  ; integer; 

Result  ; tree; 
beg  in 

if  n = 0 thf.n  Rankinverse  ;=  nil 
else  begin 

(*  Set  High  = max  { k - S[k,n]  < i } using  binary  search.  *) 
Low  0 ; 

High  ;=  n - 1 ; 
repeat 

Center  ;=  (Low  + High)  div  2; 
if  i > S (Center,  n]  then  Low  :=  Center  + 1 
else  High  :*  Center  - 1 ; 
until  Low  > High ; 

Lsize  :=  High ; 

Rsize  ;=  n - Lsize  - 1 ; 
i :»  i - S[Lsize,  n]  - 1 ; 
new(Result)  ; 
with  Result''  do 
beg  in 

left  ;=  Rankinverse ( i div  B[Rsize]  + 1,  Lsize); 
right  :=  Rankinverse ( i mod  B[Rsizel  +1,  Rsize) ; 
size  n; 
end  ; 

Rankinverse  :=  Result; 
end  ; 
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